K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 86 | 98 | 97 | 97 | 99 |
1000 | 621 | 860 | 953 | 979 | 995 |
10000 | 3280 | 6913 | 8749 | 9613 | 9897 |
100000 | 11489 | 46104 | 75376 | 90528 | 96289 |
1000000 | 11489 | 46104 | 75376 | 90528 | 96289 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings